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SHAPE  DESCRIPTION  VIA  THE  USE  Of  CRITICAL  POINTS* 


Herbert  Freemen 

Rensselaer  Polytechnic  Institute 
Troy,  Nev  York  12181 


Suimary 

A key  element  in  pattern  recognition  is  the  des- 
cription of  shape.  For  tvo-dlmensional  objects 
(blobs),  shape  is  conveyed  by  the  curving  of  the  bound- 
ary line  and  is  normally  considered  Independent  of 
scale  and  orientation.  The  curving  may  be  regarded  as 
a concatentation  of  arcs  of  varying  instantaneous  radii 
of  curvature,  possibly  interspersed  occasionally  by 
discontinuities.  The  description  of  shape  is  facili- 
tated by  segmenting  the  boundary  line  at  so-called 
critical  points  - corners  (discontinuities  in  curva- 
ture), points  of  inflection,  and  curvature  maxima. 
Additional  critical  points  are  intersections  and 
points  of  taagency.  Algorithms  are  described  for  ex- 
tracting such  critical  points  in  the  presence  of  noise. 
An  illustration  is  given  shoving  hov  the  critical 
points  may  be  used  in  the  developnent  of  a shape  des- 
cription system. 

1.  Introduction 

The  recognition  of  tvo-dlmensional  spatial  pat- 
terns is  intimately  linked  vith  the  classification  of 
shape.  Shape  is  the  composite  effect  of  curvature;  it 
is  a property  of  a curve,  independent  of  scale  and 
orientation,  and  refers  to  the  manner  in  vhich  the 
curve- deviates  from  seme  norm,  such  as  a straight  line 
or  a circle.  We  shall  use  the  terms  "line"  and  "curve" 
interchangeably;  a line  is  not  presumed  to  be  straight 
unless  explicitly  so  identified.  Neither  must  a curve 
be  non-straight.  The  kinds  of  curve  topologies  of 
Interest  to  us  are  shovn  in  Fig.  1. 

A closed  curve  traces  a single  path  without  end. 

An  open  curve  has  two  distinct  end  points.  A closed 
curve  is  always  considered  to  be  simply  connected;  a 
multiply-connected  curve  is  regarded  as  a set  of  tvo 
or  more  distinct,  closed  curves,  one  of  vhich  encloses 
all  others.  Both  open  and  closed  curves  may  intersect 
other  open  or  closed  curves,  as  veil  as  themselves. 

A closed  curve  has  a definite  sense  of  direction, 
vhich  is  such  as  to  place  the  interior  toward  the 
right.  For  an  open  curve,  the  direction  is  arbi- 
trary. 

The  points  at  which  curves  Intersect  or  touch 
will  be  called  Junctions.  Junctions  are  characterized 
by  their  radiality,  which  is  the  count  of  the  number 
of  lines  entering  the  Junction.  We  speak  of  trlradlal, 
quadradlal,  qulntradial,  etc.  Junctions.  Except  in 
very  rare  cases,  pictures  of  natural  processes  exhibit 
only  trlradlal  Junctions.  The  existence  of  a quad- 
radial  Junction  is  almost  always  evidence  of  a man- 
made feature.  Junctions  of  radiality  greater  than 
four  are  most  rare. 

At  a Junction  of  radiality  n,  pairs  of  adjacent 
curves  will  form  n distinct,  non-crossing  paths 
through  the  junction.  At  least  n-2  of  these  paths 
must  exhibit  a slope  discontinuity  (corner)  at  the 
Junction. 

We  shall  regard  all  two-dimensional  patterns  as 
occurring  in  the  form  of  maps.  Maps  are  special  kinds 
of  line  drawings  that  occur  in  the  following  four 

* This  work  was  supported  by  the  Air  Force  Office  of 
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types  - blob  maps,  contour  maps,  flow  maps,  and  region 
maps.  The  four  types  are  illustrated  in  Fig.  2. 
Different  types  may  be  overlaid  to  form  composite  maps. 

Blob  maps  are  maps  containing  only  closed  curves, 
none  of  vhich  may  contain  another  closed  curve.  Seme 
of  the  curves  may  be  closed  via  the  map  boundary.  The 
closed  curves  (vhich  each  bound  a blob)  may  touch  but 
not  intersect  each  other.  There  are  no  end  points. 

Contour  maps  are  like  blob  maps  except  that  one 
closed  curve  may  contain  one  or  more  other  closed 
curves.  Two  curves  may  be  tangent  to  each  other  pro- 
vided one  of  the  curves  is  contained  within  the  other. 
There  are  no  end  points. 

Flow  maps  contain  only  open  curves  and  these  are 
connected  to  form  one  or  more  tree  structures.  There 
are  no  loops.  All  curves  terminate  either  at  Junc- 
tions or  at  end  points.  For  each  connected  tree 
structure,  precisely  one  end  point  coincides  with  the 
map  boundary. 

In  region  maps,  the  space  is  divided  into  mutu- 
ally exclusive  areas.  All  lines  begin  and  terminate 
at  Junctions  or  are  closed  (rare).  There  are  no  end 
points. 

We  identify  three  kinds  of  problem  tasks  that 
Involve  shape  - "matching",  "fitting",  and  "classify- 
ing". In  "matching"  ve  desire  to  establish  equiva- 
lence (up  to  a certain  level  of  precision)  between 
tvo  curves  vhich  may  be  open  or  closed.  The  equiva- 
lence may  be  on  the  basis  of  (1)  shape  alone, 

(2)  shape  and  scale,  (3)  shape  and  orientation,  or 
(U)  shape,  scale,  and  orientation. 

In  "fitting”  ve  must  establish  that  tvo  open 
curves  (vhich  are  usually  parts  of  tvo  different 
closed  curves)  are  equivalent  in  scale  and  complement- 
ary in  shape.  Tvo  curves  are  said  to  be  complementary 
in  shape  if  they  are  geometrically  similar  but  are 
traced  out  in  the  opposite  sense.  (The  direction  of 
tracing  is  determined  by  the  area  bexinded,  vhich  by 
convention,  is  always  assumed  to  lie  toward  the 
right. ) 

"Classifying"  refers  to  the  process  of  assigning 
given  curves  to  predetermined  categories.  The  assign- 
ment may  be  based  solely  on  shape,  or  it  may  involve 
scale  and  orientation  as  veil. 

2.  Shape  Description 

One  of  the  difficulties  in  any  shape  processing 
task  is  the  lack  of  a definitive  way  of  describing 
shape.  Many  investigators  have  devoted  attention  to 
this  task  and  a variety  of  approaches  have  been 

proposed1-^.  We  shall  follow  a method  originally  pro- 
posed by  Attneave  and  Arnoult  and  divide  a curve  into 
accents  and  then  use  relatively  simple  features  for 
characterizing  the  segments.  Key  to  this  method  is  an 
effective  segmentation  scheme.  Later  ve  shall  also 
consider  a scheme,  applicable  to  closed  curves  only, 
that  does  not  require  segmentation . 

In  all  that  follows  it  will  be  assumed  that  the 
curves  have  been  quantized  into  a square  lattice  of 

a 
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■nd  than  encoded  in  tarns  of  tha  S-direetion  chain  , 

. 2 

coda  . 


Ua  shall  first  examine  tha  shape  matching  problem. 
A triad  and  effective  technique  for  matching  tvo 
chains  (open  or  closed)  is  the  chain  correlation 
schema,  so  called  because  of  its  analogy  to  eonven- 
9 

tlonal  signal  correlation . It  is  given  by 

♦ab(j)“n  jico,(VV,)l,/1‘ 

vhere  the  a^  and  b^  are,  raspactlvely,  the  ordered 

links  in  the  tvo  chains.  Chain  correlation  has  the 
dravback,  however,  of  being  rigidly  dependent  on  scale 
and  orientation.  To  use  it  for  matching  the  shape  of 
tvo  chains  of  different  scale  and  orientation,  one  of 
the  chains  must  first  be  rotated  and  scaled  to  bring 
it  into  alignment  vlth  the  other.  To  determine  the 
amounts  of  rotation  and  scaling  required,  some  further 
information  about  the  tvo  chains  must  be  available. 

To  take  a very  simple  example,  suppose  tvo  open 
chains  are  to  be  checked  for  a shape  match,  independ- 
ently of  scale  and  orientation.  One  can  connect  the 
tvo  end  points  of  each  chain  vlth  a straight  line  seg- 
ment. The  relative  lengths  of  the  tvo  segments  then 
provides  a basis  for  adjusting  the  scale  of  one  chain 
to  that  of  the  other;  the  angle  betveen  the  tvo  seg- 
ments establishes  the  amount  of  rotation  required  to 
bring  the  tvo  chains  into  alignment.  (Hote  that  such 
scaling  and  rotating  Introduces  additional  quantiza- 
tion noise.)  After  the  scale  and  orientation  adjust- 
ment, the  tvo  chains  can  be  correlated  against  each 
other  to  determine  the  exact  degree  of  match. 

The  simple  schesM  Just  described  has  serious  defi- 
ciencies. First  of  all,  it  will  not  work  for  closed 
curves.  Second,  the  scheme  is  overly  dependent  on  the 
precise  locations  of  the  end  points.  Relatively 
slight  variations  here  vlll  cause  erroneous  scale  and 
orientation  normalizations,  and  result  in  failure  of 
the  correlation  test,  even  for  chain  pairs  that  are 
otherwise  very  similar  in  shape.  Finally,  the  scheme 
is  Inapplicable  to  the  "fitting”  problem,  vhere  a 
section  of  one  chain  is  to  be  shown  to  be  of  coa^le- 
mentary  shape  to  a section  of  another  chain  - without 
any  a priori  knowledge  of  orientation  (and,  possibly, 
scale  as  veil).  We  shall  now  develop  a mare  powerful 
method  which  overcomes  these  difficulties. 

Even  if  ve  expect  tvo  curves  to  match  in  their 
entirety,  it  would  be  unwise  to  test  them  for  match 
over  their  full  length  at  once.  Part  of  one  chain  may 
be  missing  or  severely  distorted  by  noise,  or  perhaps 
a part  may  have  been  expressed.  Or  a chain  may  have 
been  bent,  placing  its  tvo  parts  at  a wrong  angle 
relative  to  each  other.  Although  such  effects  clearly 
reduce  the  quality  of  a match,  they  should  not  be 
permitted  to  overwhelm  evidence  of  good  matching  in 
other  parts  of  the  chains.  The  only  solution  to  this 
is  to  segment  the  chains,  and  to  proceed  with  the 
match  on  a sectlon-by-eectlon  basis. 

If  the  chains  to  be  matched  are  suppoeed  to  have 
the  same  scale  sad  orientation,  and  cross-correlation 
is  to  be  used,  segmentation  presents  no  problem. 

Since  in  the  correlation  process  one  chain  (or  chain 
section)  is  anyway  shifted  relative  to  the  other, 
selection  of  the  segmentation  points  is  virtually 
arbitrary.  A totally  different  approach,  howevar,  is 
called  for  if  the  relative  scale  and  orientation  of 
the  tvo  chains  are  not  a priori  known. 


3.  Critical  Points 

If  ve  wish  to  compare  tvo  chains  Independently  of 
their  relative  scale  and  orientation,  ve  must  find 
features  in  them  that  are  Invariant  under  these  para- 
meters. For  this  purpose  ve  shall  call  upon  the  so- 
called  "critical  points"  of  the  chains. 

In  classical  geometry,  the  "critical  points"  of 
a curve  are  its  maxima,  minima,  and  points  of  inflec- 
tion. Our  Interest  here  in  curves  is  more  far- 
reaching,  and  ve  shall  expand  the  concept  of  critical 
points  to  Include  also  discontinuities  in  curvature, 
end  points,  intersections  (Junctions),  and  points  of 
tangency.  To  a varying  degree,  these  points  are  all 
vell-deflned  and  their  character  is  unaffected  by  the 
transformations  of  scale  and  rotation. 

let  us  consider  the  tvo  curves  sbovn  in  Fig.  3. 
Visual  Inspection  shows  them  to  be  of  apparent  similar 
shape,  though,  of  difference  size,  different  orienta- 
tion, and  different  sense  of  direction.  We  note  that 
each  curve  has  two  fairly  sharp  discontinuities  in 
curvature  (DICs).  If  the  tvo  curves  are  Indeed  of  the 
same  shape  and  there  is  a pairwise  correspondence  be- 
tveen the  DICs,  then  there  must  be  point-for-point 
shape  correspondence  in  the  sections  betveen  the  DICs. 
And,  of  course,  all  shape-related  features  that  are 
independent  of  scale  and  orientation  will  be  the  same 
for  both. 

We  first  connect  the  tvo  DICs  in  each  curve  vlth 
a straight  line  segment  and  determine  its  length. 
Suitable  features  to  be  computed  next  are  (1)  the 
length  of  the  curve  section  betveen  the  DICs,  (2)  the 
total  "bay"  area  lying  betveen  the  curve  section  and 
the  straight  line  segment  (the  "bays”  lie  tovard  the 
left  of  the  curve) , (3)  the  total  "peninsula"  area 
(lying  tovard  the  right  of  the  curve),  (U)  the  maximum 
"bay”  depth,  and  (5)  the  maximum  "peninsula"  depth. 

All  these  features  are,  of  course,  independent  of 
orientation.  To  make  them  also  Independent  of  scale, 
ve  divide  each  linear  feature  by  the  segment  distance 

MB  — — £ 

AS,  and  each  area  feature,  by  AS  . The  features  are 
illustrated  in  Fig.  3. 

Simple,  efficient  algorithms  exist  for  computing 
the  foregoing  features  if  the  curves  are  represented 

in  the  form  of  chains^®.  Observe  that  if  for  tvo 
curve  sections  the  features  are  pairwise  in  close 
agreement,  then  ve  have  strong  indication  that  the 
curves  are  of  the  same  shape;  though,  of  course,  a 
correlation  or  other  point-by-point  check  is  needed  to 
establish  this  for  certain.  Conversely,  any  major 
discrepancy  in  a feature  is  indicative  of  a difference 
in  shape.  If  for  tvo  chains  all  features  match  except 
that  the  bay  features  of  one  match  to  the  peninsula 
features  of  the  other,  then  the  tvo  chains  are  likely 
to  be  of  camplnentary  shape  and  thus  potential  candi- 
dates far  fitting. 

In  the  foregoing  example  ve  selected  DICs  to 
define  curve  sections  for  matching.  Of  all  the  criti- 
cal points,  DICs  (as  veil  as  Junctions)  are  the  most 
sharply  and  most  reliably  defined.  End  points  are 
also  sharply  defined;  however,  their  location  is  less 
dependable  sinee  occlusion  of  part  of  a curve  will 
cause  a false  end  point  to  be  generated.  The  next 
best  critical  points  to  use  (if  DICs  and  Junctions  are 
not  available) are  points  of  inflection  and  points  of 
maxlmm  curvature  (maxima  or  minima) . We  shall  dis- 
cuss these  further  in  the  next  section  in  connection 
with  the  determination  of  critical  points. 
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precisely  »»1  links.  This  Is  resdily  sppsrent  fro 


Junctions  end  end  points  require  no  speclel 
effort  for  their  determination;  they  ere  available 
directly  from  the  chain  date  structure.  Discontinui- 
ties In  curvature,  however.  Bust  he  extracted  by  com- 
putation. A sc heei e that  has  been  found  to  be  effec- 
tive for  this  purpose  Is  the  line  sesaent  scan11, 
described  as  follows. 

He  consider  s chain  {a. }n,  s e{0 7),  and 

1 . 

define  a straight  line  segment  that  connects  the 
the  lnltlum  of  a1_i+1  to  the  terminus  of  a^.  The 
length  of  L*  can  vary  from  a maximum  of  s/f  to  a min- 

1 mm  of  1;  however,  for  a well-quantized  curve.  It 
should  never  be  less  than  s/2.  The  actual  length  of 

L*  is,  of  course,  given  by 

tj  • t(xj)2  ♦ (rj)2]1/2 

where  if  - l a 

1 J*i-e*l 

v*  f 

ri  “ £ 4ly 

1 Jel-S+1 

and  where  the  a^  and  aly  are,  respectively,  the  x and 
y component*  of  the  chain  links,  a^,  *ly  t{-l,0,l). 

The  angle  between  the  x axis  and  the  forward 
direction  of  the  line  segaent  is 

ej  - tan*1  fj/xj  If  |X*|  > |XJ| 

- cot"1  Xj/Tj  if  |xj|  < |T*| 

and  can  vary  from  0 to  360  degrees.  Of  particular 

Interest  is  the  variation  in  this  angle  as  L*  scans 

over  the  chain.  lote  that  for  a closed  chain, 
a,  ■ a.  We  define 


an  Inspection  of  Fig. 


The  sun  of  the  4 . values 


over  the  s+1  steps  at  a discontinuity  is  given  by 
1 


i 


0*  ♦ 0®  - 0* 
ifl  1 i-s-1 


C.-2 


and  Is  equal  to  twice  the  net  angular  change  at  the 
discontinuity. 

The  detection  of  curvature  discontinuity  in  the 
presence  of  noise  Is,  of  course,  a sanevhat  subjective 
process.  A chain  Is  a special  kind  of  straight-line 
approximation  of  a curve  in  which  all  segments  are  of 
lengths  1 or  Vz,  and  all  angular  changes  arc  multiples 
of  U5  degrees.  To  get  above  the  chain  quantization 
noise  level,  a value  of  s of  at  least  it  is  required. 
Heavier  smoothing  (s>U)  is  desirable,  but  at  some 
Increased  value,  valid  curvature  discontinuities  will 
be  smoothed  over.  In  the  context  of  a chain  represen- 
tation (where  true  curvature  continuity  does  not 
exist),  a discontinuity  in  curvature  can  only  be  de- 
fined as  the  combination  of  three  concatenated 

sequences  of  4*  values  - two  during  which  4®  varies 

about  a small  mean  value  (or  about  zero),  separated  by 
one  precisely  of  length  s+1  during  which  It  differs 
significantly  from  zero  and  accumulates  a substantial 

total  variation  ♦*.  In  Fig.  U(b),  this  condition  1s 
satisfied  In  the  region  i*19  to  1*21*.  For  this 
region  the  accumulated  Incremental  curvature 

equals  189.2,  indicating  a net  CCV  curvature  of  9**. 6 
degrees.  The  curvature  discontinuity  is  clearly 
evident;  its  location  is  fixed  at  the  value  of  1 Just 

preceding  the  first  significant  value  of  4®,  that  is, 
at  1*18. 

In  general,  we  shall  refer  to  the  prominence  of 
a curvature  discontinuity  as  its  "cornerity"  and 

assign  the  following  quantitative  measure:11 


as  the  incremental  curvature  of  the  chain;  It  is 
equal  to  twice  the  mean  over  two  successive  angular 
differences.  The  incremental  curvature  is  a smoothed 
measure  of  curvature ; the  greater  s,  the  heavier  the 

a 

smoothing.  For  a well -quantized  curve,  s will  range 
normally  from  a minimum  of  5 to  a maximum  of  about  13. 
This  appears  to  be  the  optlmim  range  for  removing  the 
effect  of  chain  quantization  noise  and  yet  preserving 
significant  fine  detail.  An  illustration  of  line- 
segnent  scanning  Is  given  in  Fig.  It. 

A plot  of  4*  for  a given  chain-encoded  curve  pro- 
vides Insight  Into  the  shape  of  the  curve.  The  plot 
la,  of  course.  Independent  of  the  orientation  of  the 
curve,  and,  if  the  abscissa  Is  normalized,  xan  be  made 
Independent  of  seale  as  well.  For  a fairly  straight 

section  of  a curve,  4®  will  hover  about  zero.  For  a 
gentle  curving  to  the  left  or  right,  4*  will  maintain 
a small  positive  or  negative  value,  respectively.  At 
a point  of  Inflection,  4*  will  change  sign.  At  a 
significant  curvature  discontinuity,  4*  will  take  on 

a succession  of  relatively  large  values,  all  of  the 

same  sign,  and  all  compressed  into  a span  of 
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where  t^  and  tg  are  the  lengths  of  the  4*  sequences, 
to  either  side  of  the  discontinuity,  over  which  4* 

remains  at  a small  mean  value. 

For  the  purpose  of  shape  matching  (or  fitting) 
using  scale-  and  orientation-invariant  features,  only 
discontinuities  with  large  cornerity  values  should  be 
used.  Use  of  discontinuities  with  small  cornerity 
values  may  lead  to  erroneous  segmentation. 

examination  of  Fig.  lk(b)  also  shows  that  4^ 

changes  sign  at  1*12,  indicating  the  presence  of  a 
point  of  inflection.  The  actual  location  of  the 
point  of  Inflection  Is,  of  course,  s/2  positions 

earlier  since  4*  Is  labelled  according  to  its  leading 

node.  In  the  example  shown,  with  s*3,  we  place  the 
point  of  inflection  at  i*10. 

If  the  point  of  Inflection  is  prominent,  that 
is.  If  there  is  a clear  and  abrupt  shift  of  the  radius 
of  curvature  frem  one  side  of  the  chain  to  the  other, 
the  preceding  method  will  locate  it.  lot  infrequent- 
ly, however,  a curve  changes  from  convex  to  concave 
curvature  in  a slow  and  meandering  fashion.  Then  we 
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cannot  determine  tha  point  of  lnflaction  In  this  way. 
If  tha  tvo  rations  of  opposing  curratura  (for  which  wa 
ara  seeking  tha  point  of  lnflaction)  a re  both  pro- 
longed and  strong,  wa  oay  find  their  maxima.  Ve  then 
cannect  these  two  points  of  maximum  curratura  with  a 
straight  line  and  taka  its  intersection  with  the 
chain,  lying  between  tha  two  regions  of  curratura,  as 
tha  point  of  inflection.  If  tha  two  curratura  regions 
ara  not  sufficiently  strong,  or  ara  spaced  too  far 
apart,  the  point  of  inflection  cannot  be  reliably 
determined  and  we  must  seek  other  critical  points 
elsewhere. 

5 ■ Centroldal  Profile 

An  alternate  schesM  for  representing  the  shape  of 
a blob  in  an  orientation-  and  scale-invariant  manner 
is  provided  by  the  centroldal  profile.  It  is  appli- 
cable only  to  closed  curves  and  does  not  involve  any 
segmentation.  The  centroldal  profile  is  simply  a 
normalized  plot  of  the  distance  from  the  boundary  to 
the  centroid  of  the  blob,  as  a function  of  distance 
along  the  boundary.  Since  the  centroid's  coordinates 
are  determined  by  the  ratio  of  first-order  moments  to 
enclosed  area,  they  are  relatively  insensitive  to 
noise,  and  the  centroid  thus  tends  to  be  a stable 
point  of  reference  for  the  blob.  It  a blob  is  des- 
cribed in  the  form  of  a chain,  its  centroid  can  be 
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rapidly  computed  by  means  of  a simple  algorithm 

To  obtain  the  centroldal  profile  for  a chain- 
coded  blob,  one  first  computes  the  location  of  the 
centroid.  Next  one  determines  the  distances  from  the 
centroid  to  each  chain  node  in  sequence,  beginning 
with  the  Initium  of  the  chain.  The  Euclidean  distance 
measure  could  be  used;  however,  a simpler  and  equally 
effective  measure  for  the  purpose  at  hand  is  provided 
by  the  residue-chain  length.  A residue  chain  between 
two  points  is  the  standardized  chain  of  minimum  length 

connecting  the  two  points10.  The  computation  of  resi- 
due-chain length  involves  only  the  operations  of  addi- 
tion and  branching,  and  because  of  its  simplicity  is 
the  preferred  distance  measure  for  the  centroldal  pro- 
file. Its  use  does  require,  however,  that  the 
coordinates  of  the  centroid  he  quantized  to  the 
nearest  grid  nods,  thereby  slightly  increasing  the 
quantization  noise  in  the  blob  description. 

For  a closed  chain,  the  choice  of  the  initium  is 
normally  arbitrary.  To  provide  seme  uniformity  in  the 
selection  of  the  origin  of  the  centroldal  profile,  the 
initium  is  cyclically  shifted  to  the  chain  node  maxi- 
mally distant  from  ths  centroid.  If  the  maximum  is 
not  unique,  any  of  the  maxima  may  be  choaen.  This 
shifting  of  the  initium  provides  some  orientation  in- 
variance for  the  centroldal  profile.  However,  if 
there  is  more  than  one  maximum,  there  is  an  ambiguity 
of  order  equal  to  the  number  of  maximal 1 y -distant 
chain  nodes.  This  is  only  a minor  drawback,  as  we 
shall  see  shortly. 

The  dependence  on  scale  can  be  removed  by  divid- 
ing all  profile  values  by  the  maximum  value.  The 
actual  maxi  miss  value  will  be  called  the  vertical 
norma 11 s at 1 on  factor.  Every  centroldal  profile  thus 
will  have  a value  of  unity  at  the  origin  and  be  leas 
than  or  equal  to  unity  elsewhere. 

One  additional  normalisation  step  remains.  There 
is  no  fixed  rile  concerning  the  number  of  chain  links 
that  ara  needed  to  describe  a given  blob.  Ouidelinea 
exist  for  selecting  the  quantization  grid  fineness 
that  are  based  on  the  value  of  the  minimum  radius  of 
curvature  that  is  to  be  preserved  in  the  resulting 

ehain^.  However,  the  selection  of  the  nalleat  radius 


of  curvature  is  subjective,  and  a shape  comparison 
scheme  should  not  be  too  rigidly  dependent  on  such  a 
loose  criterion.  Accordingly  we  must  also  normalize 
the  abscissa  of  the  centroldal  profile.  This  is  done 
by  setting  the  maximum  abscissa  value  - corresponding 
to  one  complete  trace  around  the  blob  contoui  (or, 
equivalently,  one  pass  over  the  complete  chain)  - 
arbitrarily  to  100.  All  other  values  are  then 
adjusted  in  proportion.  The  factor  required  to  yield 
the  uniform  abscissa  scale  will  be  called  the  horizon- 
tal normalization  factor.  A centroldal  profile, 
together  with  the  blob  whose  shape  it  represents,  is 
shown  in  Fig.  5. 

For  the  blob  of  Fig.  5,  the  vector  from  the  cen- 
troid to  successive  chain  nodes  rotates  uniformly 
about  the  centroid  in  the  clockwise  sense.  This  will, 
of  course,  not  be  the  case  in  general.  To  avoid  pos- 
sible ambiguities  in  the  Interpretation  of  a centrold- 
al profile,  we  shall  take  profile  values  corresponding 
to  clockwise  angular  rotation  as  being  positive,  and 
those  corresponding  to  counterclockwise  rotation,  as 
being  negative.  An  Illustration  of  this  is  provided 
by  Fig.  6. 

The  centroidsii  profile  represents  the  blob  with- 
out regard  to  scale,  orientation,  and  (in  a limited 
sense)  quantization  fineness.  In  effect  only  the 
shape  remains.  Note  that  the  profile  generation  pro- 
cess is  reversible;  if  the  origin  shift  and  the  verti- 
cal and  horizontal  normalization  factors  are  known,  it 
is  possible  to  reconstruct  the  blob  chain  from  its 
centroidal  profile. 

Two  blobs  may  be  compared  as  to  their  shape  by 
comparing  their  centroldal  profiles.  If  both  profiles 
have  a single,  well-defined  maximum  (which,  of  course, 
will  be  at  the  origin),  the  comparison  is  straight- 
forward. Suitable  measures  of  shape  dissimilarity  are 
the  integral  of  the  differences  between  the  tvo  pro- 
file functions  and  the  integral  of  the  squared  differ- 
ences. 

If  one  or  both  of  the  profiles  have  a maximum 
that  is  not  veil  defined,  or  have  multiple  maxima,  a 
better  means  for  comparison  is  provided  by  convention- 
al signal  cross-correlation.  This  will  yield  a meas- 
ure of  the  similarity  as  well  as  identify  possible 
errors  in  the  origin  selection. 

It  should  be  pointed  out  that  the  abscissa  of  the 
centroldal  profile  represents  the  (normalized)  link- 
index  parameter  1 of  the  boundary  chain.  It  does  not 
represent  uniform  angular  rotation  about  the  centroid; 
nor,  in  fact,  is  it  a precise  measure  of  distance 
along  the  boundary  because  no  account  is  taken  of  the 
actual  lengths  of  the  links  (1  or  /?).  This  variation 
in  link  length,  together  with  the  initial  chain- 
generating quantization  and  the  node-quantization  of 
the  centroid  mentioned  earlier  are  the  chief  sources 
of  noise.  Unless  held  to  a low  value  by  using  a 
sufficiently  fine  grid  size,  the  effect  of  this  noise 
will  be  to  reduce  the  quality  of  shape  matching. 

Some  good  "feel"  for  adequate  quantization  can  be 
gotten  by  selecting  a blob  that  is  representative  of 
those  of  Interest,  quantizing  it  for  tvo  different 
orientations,  computing  the  corresponding  centroidal 
profiles,  and  then  comparing  these.  If  the  quality 
of  match  ia  satisfactory,  the  quantization  fineness 
is  likely  to  be  adequate. 

The  precision  of  the  centroldal  profile  can  be 
Improved  - though  at  considerable  increase  in  process- 
ing cost  - by  plotting  the  distance  from  centroid  to 
chain  node  against  the  angle  this  vector  makes  with 
the  x axis,  and  using  the  Euclidean  distance  from  the 


b 


I 


true  centroid  (rather  than  free  the  node  nearest  the 
centroid).  The  *sme  origin  shift  and  vertical  normal- 
isation are  then  applied.  The  abscissa  range  now 
will , of  course,  be  precisely  equal  to  360°  and  no 
horizontal  normalization  is  necessary.  Although  rela- 
tively simple  algorithms  exist  for  calculating  the 
Euclidean  distances  between  points  and  the  angles 

relative  to  the  x axis10,  indications  thus  far  are 
that  these  refinements  are  not  worth  the  extra 
effort. 
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Fig.  2.  Four  kinds  of  maps,  (a)  blob  map,  (b)  contour 
map,  (c)  flow  map,  (d)  region  map. 
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A key  element  in  pattern  recognition  is  the  des- 
cription of  shape.  For  tvo-dimensional  objects 
(blobs),  shape  is  conveyed  by  the  curving  of  the  bound- 
ary line  and  is  normally  considered  independent  of 
scale  and  orientation.  The  curving  may  be  regarded  as 
a concatentation  of  arcs  of  varying  instantaneous  radii 
of  curvature,  possibly  interspersed  occasionally  by 
discontinuities.  The  description  of  shape  is  facili- 
tated by  segmenting  the  boundary  line  at  so-called 
critical  points  - corners  (discontinuities  in  curva- 
ture), points  of  inflection,  and  curvature  maxima. 
Additional  critical  points  are  intersections  and 
points  of  tangency.  Algorithms  are  described  for  ex- 
tracting such  critical  points  in  the  presence  of  noise. 
An  illustration  is  given  shoving  hov  the  critical 
points  may  be  used  in  the  development  of  a shape  des- 
cription system. 

1.  Introduction 

The  recognition  of  tvo-dimensional  spatial  pat- 
terns is  intimately  linked  vith  the  classification  of 
shape.  Shape  is  the  composite  effect  of  curvature;  it 
is  a property  of  a curve,  independent  of  scale  and 
orientation,  and  refers  to  the  manner  in  which  the 
curve  deviates  from  some  norm,  such  as  a straight  line 
or  a circle.  We  shall  use  the  terms  "line"  and  "curve" 
interchangeably;  a line  is  not  presumed  to  be  straight 
unless  explicitly  so  identified.  Neither  must  a curve 
be  non-straight.  The  kinds  of  curve  topologies  of 
interest  to  us  are  shovn  in  Fig.  1. 

A closed  curve  traces  a single  path  vithout  end. 

An  open  curve  has  tvo  distinct  end  points.  A closed 
curve  is  alvays  considered  to  be  simply  connected;  a 
multiply-connected  curve  is  regarded  as  a set  of  tvo 
or  more  distinct,  closed  curves,  one  of  vhlch  encloses 
all  others.  3oth  open  and  closed  curves  may  intersect 
other  open  or  closed  curves,  as  veil  as  themselves. 

A closed  curve  has  a definite  sense  of  direction, 
vhlch  is  such  as  to  place  the  interior  tovard  the 
right.  For  an  open  curve,  the  direction  is  arbi- 
trary. 

The  points  at  vhlch  curves  intersect  or  touch 
vl 11  be  called  Junctions.  Junctions  are  characterized 
by  their  radlality,  vhlch  is  the  count  of  the  number 
of  lines  entering  the  Junction.  We  speak  of  trlradlal, 
quadradial,  quintradlal,  etc.  Junctions.  Except  in 
very  rare  cases,  pictures  of  natural  processes  exhibit 
only  trlradlal  Junctions.  The  existence  of  a quad- 
radial  Junction  is  almost  alvays  evidence  of  a man- 
made feature.  Junctions  of  radlality  greater  than 
four  are  most  rare. 

At  a Junction  of  radlality  n,  pairs  of  adjacent 
curves  vlll  form  n distinct,  non-crossing  paths 
through  the  Junction.  At  least  n-2  of  these  paths 
must  exhibit  a slope  discontinuity  (corner)  at  the 
Junction. 

We  shall  regard  all  tvo-dimensional  patterns  as 
occurring  in  the  form  of  maps.  Maps  are  special  kinds 
of  line  drawings  that  occur  in  the  following  four 

• This  vork  was  supported  by  the  Air  Force  Office  of 
Scientific  Research,  Directorate  of  Mat hmaat leal  and 
Information  Sciences,  under  Orant  AFOSR  76-293T. 


types  - blob  maps,  contour  maps,  flow  maps,  and  region 
maps.  The  four  types  are  illustrated  in  Fig.  2. 
Different  types  may  be  overlaid  to  form  composite  maps. 

Blob  maps  are  maps  containing  only  closed  curves, 
none  of  which  may  contain  another  closed  curve.  Some 
of  the  curves  may  be  closed  via  the  map  boundary.  The 
closed  curves  (vhich  each  bound  a blob)  may  touch  but 
not  intersect  each  other.  There  are  no  end  points. 

Contour  maps  are  like  blob  maps  except  that  one 
closed  curve  may  contain  one  or  more  other  closed 
curves.  Tvo  curves  may  be  tangent  to  each  other  pro- 
vided one  of  the  curves  is  contained  within  the  other. 
There  are  no  end  points. 

Flow  maps  contain  only  open  curves  and  these  are 
connected  to  form  one  or  more  tree  structures.  There 
are  no  loops.  All  curves  terminate  either  at  Junc- 
tions or  at  end  points.  For  each  connected  tree 
structure,  precisely  one  end  point  coincides  vith  the 
map  boundary. 

In  region  maps,  the  space  is  divided  into  mutu- 
ally exclusive  areas.  All  lines  begin  and  terminate 
at  Junctions  or  are  closed  (rare).  There  are  no  end 
points. 

We  identify  three  kinds  of  problem  tasks  that 
involve  shape  - "matching",  "fitting",  and  "classify- 
ing". In  "matching"  ve  desire  to  establish  equiva- 
lence (up  to  a certain  level  of  precision)  between 
tvo  curves  vhich  may  be  open  or  closed.  The  equiva- 
lence may  be  on  the  basis  of  (l)  shape  alone, 

(2)  shape  and  scale,  (3)  shape  and  orientation,  or 
(1)  shape,  scale,  and  orientation. 

In  "fitting"  ve  must  establish  that  tvo  open 
curves  (vhich  are  usually  parts  of  tvo  different 
closed  curves)  are  equivalent  in  scale  and  complement- 
ary in  shape.  Tvo  curves  are  said  to  be  complementary 
in  shape  if  they  are  geometrically  similar  but  are 
traced  out  in  the  opposite  sense.  (The  direction  of 
tracing  is  determined  by  the  area  bounded,  vhich  by 
convention,  is  alvays  assumed  to  lie  tovard  the 
right . ) 

"Classifying"  refers  to  the  process  of  assigning 
given  curves  to  predetermined  categories.  The  assign- 
ment may  be  based  solely  on  shape , or  it  may  involve 
scale  and  orientation  as  veil. 

2.  Shape  Description 


One  of  the  difficulties  in  any  shape  processing 
task  is  the  lack  of  a definitive  vay  of  describing 
shape.  Many  investigators  have  devoted  attention  to 
this  task  and  a variety  of  approaches  have  been 

proposed1”"^.  We  shall  follov  a method  originally  pro- 
posed by  Attneave  and  Arnoult  and  divide  a curve  into 
segments  and  then  use  relatively  simple  features  for 
characterizing  the  segments.  Key  to  this  method  is  an 
effective  segmentation  scheme.  Later  ve  shall  also 
consider  a scheme,  applicable  to  closed  curves  only, 
that  does  not  require  segmentation. 

In  all  that  follows  it  vlll  be  assumed  that  the 
curves  have  been  quantized  into  a square  lattice  of 
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sufficient  fineness  to  preserve  the  desired  detail 
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and  then  encoded  in  terms  of  the  8-direction  chain 

. 2 
code  . 


We  shell  first  examine  the  shape  matching  problem. 
A tried  and  effective  technique  for  matching  tvo 
chains  (open  or  closed)  is  the  chain  correlation 
scheme,  so  called  because  of  its  analogy  to  conven- 
q 

tional  signal  correlation  . It  is  given  by 

®ab(J)“n  ^ co*<VVj,W/k 

vhere  the  a,  and  b(  are,  respectively,  the  ordered 

links  in  the  two  chains.  Chain  correlation  has  the 
drawback,  however,  of  being  rigidly  dependent  on  scale 
and  orientation.  To  use  it  for  matching  the  shape  of 
two  chains  of  different  scale  and  orientation,  one  of 
the  chains  must  first  be  rotated  and  scaled  to  bring 
it  into  alignment  with  the  other.  To  determine  the 
amounts  of  rotation  and  scaling  required,  seme  further 
information  about  the  two  chains  must  be  available. 

To  take  a very  simple  example,  suppose  two  open 
chains  are  to  be  checked  for  a shape  match,  independ- 
ently of  scale  and  orientation.  One  can  connect  the 
two  end  points  of  each  chain  with  a straight  line  seg- 
ment. The  relative  iengths  of  the  two  segments  then 
provides  a basis  for  adjusting  the  scale  of  one  chain 
to  that  of  the  other;  the  angle  between  the  two  seg- 
ments establishes  the  amount  of  rotation  required  to 
bring  the  two  chains  into  alignment.  (Rote  that  such 
scaling  and  rotating  introduces  additional  quantiza- 
tion noise.  ) After  the  scale  and  orientation  adjust- 
ment, the  two  chains  can  be  correlated  against  each 
other  to  determine  the  exact  degree  of  match. 

The  simple  scheme  Just  described  has  serious  defi- 
ciencies. Tirst  of  all,  it  will  not  work  for  closed 
curves.  Second,  the  ichae  is  overl;  dependent  on  the 
precise  locations  of  the  end  points.  Relatively 
slight  variations  here  will  cause  erroneous  scale  and 
orientation  normalizations,  and  result  in  failure  of 
the  correlation  teat , even  for  chain  pairs  that  are 
otherwise  very  similar  in  shape.  Finally,  the  scheme 
is  inapplicable  to  the  "fitting"  problem,  where  a 
section  of  one  chain  is  to  be  shown  to  be  of  comple- 
mentary shape  to  a section  of  another  chain  - without 
any  a priori  knowledge  of  orientation  (and,  possibly, 
scale  as  well ) . We  shall  now  develop  a more  powerful 
method  which  overcomes  these  difficulties. 

Even  if  we  expect  two  curves  to  match  in  their 
entirety,  it  would  be  unwise  to  test  them  for  match 
over  their  full  length  at  once.  Part  of  one  chain  may 
be  missing  or  severely  distorted  by  noise,  or  perhaps 
a part  may  have  been  compressed . Or  a chain  may  have 
been  bent,  placing  its  tvo  parts  at  a wrong  angle 
relative  to  each  other.  Although  such  effects  clearly 
reduce  the  quality  of  a match,  they  should  not  be 
permitted  to  overwhelm  evidence  of  good  matching  in 
other  parts  of  the  chains.  The  only  solution  to  this 
is  to  segment  the  chains,  and  to  proceed  with  the 
match  on  a section-by-section  basis . 

If  the  chains  to  be  matched  are  supposed  to  have 
the  same  scale  and  orientation,  and  cross-correlation 
is  to  be  used,  segmentation  presents  no  problem. 

Since  in  the  correlation  process  one  chain  (or  chain 
section)  is  anyway  shifted  relative  to  the  other, 
selection  of  the  segmentation  points  is  virtually 
arbitrary.  A totally  different  approach,  however,  is 
called  for  if  the  relative  scale  and  orientation  of 
the  two  chains  are  not  a priori  known. 


3.  Critical  Points 

If  we  wish  to  compare  two  chains  independently  of 
their  relative  scale  and  orientation,  we  must  find 
features  in  them  that  are  invariant  under  these  para- 
meter! , For  this  purpose  we  shall  call  upon  the  so- 
called  "critical  points"  of  the  chains. 

In  classical  geometry,  the  "critical  points"  of 
a curve  are  its  maxima,  minima,  and  points  of  inflec- 
tion. Our  interest  here  in-  curves  is  more  far- 
reaching,  and  we  shall  expand  the  concept  of  critical 
points  to  Include  also  discontinuities  in  curvature, 
end  points,  intersections  (Junctions),  and  points  of 
tangency.  To  a varying  degree,  these  points  are  all 
well-defined  and  their  character  is  unaffected  by  the 
transformations  of  scale  and  rotation. 

let  us  consider  the  two  curves  shown  in  Fig.  3. 
Visual  inspection  shows  them  to  be  of  apparent  similar 
shape,  though,  of  difference  size,  different  orienta- 
tion, and  different  sense  of  direction.  We  note  that 
each  curve  has  two  fairly  sharp  discontinuitiej  in 
curvature  (DICs).  If  the  two  curves  are  indeed  of  the 
same  shape  and  there  is  a pairwise  correspondence  be- 
tween the  DICs,  then  there  must  be  point-for-point 
shape  correspondence  in  the  sections  between  the  DICs. 
And,  of  course,  all  shape-related  features  that  are 
independent  of  scale  and  orientation  will  be  the  same 
for  both. 

We  first  connect  the  two  DICs  in  each  curve  with 
a straight  line  segment  and  determine  its  length. 
Suitable  features  to  be  computed  next  are  (1)  the 
length  of  the  curve  section  between  the  DICs,  (2)  the 
total  "bay"  area  lying  between  the  curve  section  and 
the  straight  line  segment  (the  "bays"  lie  toward  the 
left  of  the  curve),  (3)  the  total  "peninsula"  area 
(lying  toward  the  right  of  the  curve) , (U)  the  maximum 
"bay"  depth,  and  (5)  the  maximum  "peninsula"  depth. 

All  these  features  are,  of  course,  independent  of 
orientation.  To  make  them  also  Independent  of  scale, 
we  divide  each  linear  feature  by  the  segment  distance 
— ^ 

AB,  and  each  area  feature,  by  AE  . The  features  are 
illustrated  in  Fig.  3. 

Simple,  efficient  algorithms  exist  for  computing 
the  foregoing  features  if  the  curves  are  represented 
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in  the  form  of  chains  . Observe  that  if  for  two 
curve  sections  the  features  are  pairwise  in  close 
agreement,  then  we  have  strong  indication  that  the 
curves  are  of  the  same  shape;  though,  of  course,  a 
correlation  or  other  point-by-point  check  is  needed  to 
establish  this  for  certain.  Conversely,  any  major 
discrepancy  in  a feature  is  indicative  of  a difference 
in  shape.  If  for  two  chains  all  features  match  except 
that  the  bay  features  of  one  match  to  the  peninsula 
features  of  the  other,  then  the  two  chains  are  likely 
to  be  of  complementary  shape  and  thus  potential  candi- 
dates for  fitting. 

In  the  foregoing  example  we  selected  DICs  to 
define  curve  sections  for  matching.  Of  all  the  criti- 
cal points,  DICs  (as  well  as  Junctions)  are  the  most 
sharply  and  most  reliably  defined.  End  points  are 
also  sharply  defined;  however,  their  location  is  less 
dependable  since  occlusion  of  part  of  a curve  will 
cause  a false  end  point  to  be  generated.  The  next 
best  critical  points  to  use  (if  DICs  and  Junctions  are 
not  available)  are  points  of  inflection  and  points  of 
maximum  curvature  (maxima  or  minima).  We  shall  dis- 
cuss these  further  in  the  next  section  in  connection 
with  the  determination  of  critical  points. 
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U.  Determination  of  Critical  Points 

Junction#  and  end  points  require  no  special 
effort  for  their  determination;  they  are  available 
directly  from  the  chain  data  structure.  Discontinui- 
ties in  curvature,  hovever,  must  be  extracted  by  com- 
putation. A scheme  that  has  been  found  to  be  effec- 
tive for  this  purpose  is  the  line  segment  scan~^ . 
described  as  follovs. 


precisely  s+1  links.  This  Is  readily  apparent  from 
an  Inspection  of  Fig.  4.  The  sum  of  the  &*  values 
over  the  s+1  steps  at  a discontinuity  is  given  by 
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We  consider  a chain  {a. }n,  a c{0,... ,7),  and 

1 s 

define  a straight  line  segment  that  connects  the 
the  initiua  of  ai  to  the  terminus  of  #1 . The 
length  of  L®  can  vary  from  a maximum  of  s>/?  to  a min- 
imum of  lj  hovever,  for  a veli-quantized  curve,  it 
should  never  be  leas  than  s/2.  The  actual  length  of 

lj  is,  of  course,  given  by 


**  - ((X?)2  ♦ (Y®)2] 
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vhere  X • J a 

J«i-s+l 

Yi  ■ 1 *iy 
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and  vhere  the  a^  and  a,^  are,  respectively,  the  x and 
y components  of  the  chain  links,  a^,  a^  e {-1,0,1}. 

The  angle  betveen  the  x axis  and  the  forvard 
direction  of  the  line  segment  is 
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8,  • tan  Y./X. 
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and  can  vary  from  0 to  360  degrees.  Of  particular 
interest  is  the  variation  in  this  angle  as  L®  scans 

over  the  chain.  Rote  that  for  a closed  chain, 

a,  ■ a.  We  define 
i 1-n 
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as  the  incremental  curvature  of  the  chain;  it  is 
equal  to  tvice  the  mean  over  tvo  successive  angular 
differences.  The  incremental  curvature  is  a smoothed 
measure  of  curvature;  the  greater  s,  the  heavier  the 
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smoothing.  For  a veil -quantized0  curve,  s will  range 
normally  from  a minimum  of  5 to  a maximum  of  about  13 
This  appears  to  be  the  optimum  range  for  removing  the 
effect  of  chain  quantization  noise  and  yet  preserving 
significant  fine  detail.  An  illustration  of  line- 
segment  scanning  is  given  in  Fig.  U. 


A plot  of  4*  for  a given  chain-encoded  curve  pro- 
vides insight  into  the  shape  of  the  curve.  The  plot 
is,  of  course.  Independent  of  the  orientation  of  the 
curve,  and,  if  the  abscissa  is  normalized,  .can  be  made 
independent  of  scale  as  veil.  For  a fairly  straight 

section  of  a curve,  4*  vill  hover  about  zero.  For  a 
gentle  curving  to  the  left  or  right,  4”  vill  maintain 
a small  positive  or  negative  value,  respectively.  At 
a point  of  inflection,  4*  vill  change  sign.  At  a 
significant  curvature  discontinuity,  4*  vill  take  on 

a succession  of  relatively  large  values,  all  of  the 
same  sign,  and  all  ccmpressed  into  a span  of 


and  is  equal  to  tvice  the  net  angular  change  at  the 
discontinuity. 

The  detection  of  curvature  discontinuity  in  the 
presence  of  noise  is,  of  course,  a samevhat  subjective 
process.  A chain  is  a special  kind  of  straight-line 
approximation  of  a curve  in  vhlch  all  segments  are  of 
lengths  1 or  /2,  and  all  angular  changes  are  multiples 
of  U;  degrees.  To  get  above  the  chain  quantization 
noise  level,  a value  of  s of  at  least  U is  required. 
Heavier  smoothing  (s>U)  is  desirable,  but  at  some 
increased  value,  valid  curvature  discontinuities  vill 
be  smoothed  over.  In  the  context  of  a chain  represen- 
tation (vhere  true  curvature  continuity  does  not 
exist),  a discontinuity  in  curvature  can  only  be  de- 
fined as  the  combination  of  three  concatenated 

sequences  of  6*  values  - tvo  during  vhich  df  varies 

about  a small  mean  value  (or  about  zero),  separated  by 
one  precisely  of  length  s+1  during  vhich  it  differs 
significantly  from  zero  and  accumulates  a substantial 

total  variation  t®.  In  Fig.  U(b),  this  condition  is 
satisfied  in  the  region  i»19  to  i«2U.  For  this 


equals  189.2,  indicating  a net  CCW  curvature  of  9^.6 
degrees.  The  curvature  discontinuity  is  clearly 
evident;  its  location  is  fixed  at  the  value  of  i Just 

preceding  the  first  significant  value  of  6®,  that  is, 
at  l»l8. 


In  general,  ve  shall  refer  to  the  prominence  of 
a curvature  discontinuity  as  its  "cornerity"  and 

assign  the  follovlng  quantitative  measure: 


Ki  ■ *T*2 


1+s+l 


vhere  t,  and  t^  are  the  lengths  of  the  4f  sequences, 
to  either  side  of  the  discontinuity,  over  vhich  4® 

remains  at  a small  mean  value. 


For  the  purpose  of  shape  matching  (or  fitting) 
using  scale-  and  orientation-invariant  features,  only 
discontinuities  vith  large  cornerity  values  should  be 
used.  Use  of  discontinuities  vith  ssiall  cornerity 
values  may  lead  to  erroneous  segmentation. 

Examination  of  Fig.  lU(b)  also  shovs  that  4^ 

changes  sign  at  i«12,  indicating  the  presence  of  a 
point  of  inflection.  The  actual  location  of  the 
point  of  inflection  is,  of  course,  s/2  positions 

earlier  since  4f  is  labelled  according  to  its  leading 

node.  In  the  example  shovn,  vith  s»5,  ve  place  the 
point  of  Inflection  at  i«10. 

If  the  point  of  Inflection  is  prominent,  that 
is,  if  there  is  a clear  and  abrupt  shift  of  the  radius 
of  curvature  fran  one  side  of  the  chain  to  the  other , 
the  preceding  method  vill  locate  it.  Rot  infrequent- 
ly, hovever,  a curve  changes  from  convex  to  concave 
curvature  in  a slov  and  meandering  fashion.  Then  ve 
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cannot  determine  the  point  of  Inflection  In  this  way. 
If  the  tvo  regions  of  opposing  curvature  (for  which  we 
are  seeking  the  point  of  Inflection)  are  both  pro- 
longed and  strong,  we  may  find  their  maxima.  Ve  then 
cannect  these  two  points  of  maximum  curvature  with  a 
straight  line  and  take  its  intersection  with  the 
chain,  lying  between  the  two  regions  of  curvature,  as 
the  point  of  inflection.  If  the  two  curvature  regions 
are  not  sufficiently  strong,  or  are  spaced  too  far 
apart,  the  point  of  inflection  cannot  be  reliably 
determined  and  we  must  seek  other  critical  points 
elsewhere. 


5.  Centroldal  Profile 

An  alternate  scheme  for  representing  the  shape  of 
a blob  in  an  orientation-  and  scale-invariant  manner 
is  provided  by  the  centroldal  profile.  It  is  appli- 
cable only  to  closed  curves  and  does  not  involve  any 
sefpsentation.  The  centroldal  profile  is  simply  a 
normalized  plot  of  the  distance  from  the  boundary  to 
the  centroid  of  the  blob,  as  a function  of  distance 
along  the  boundary.  Since  the  centroid's  coordinates 
are  determined  by  the  ratio  of  first-order  moments  to 
enclosed  area,  they  are  relatively  insensitive  to 
noise,  and  the  centroid  thus  tends  to  be  a stable 
point  of  reference  for  the  blob.  It  a blob  is  des- 
cribed in  the  form  of  a chain,  its  centroid  can  be 
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rapidly  catputed  by  means  of  a simple  algorithm  . 

To  obtain  the  centroldal  profile  for  a chain- 
coded  blob,  one  first  computes  the  location  of  the 
centroid.  Sext  one  determines  the  distances  from  the 
centroid  to  each  chain  node  in  sequence,  beginning 
with  the  lnitium  of  the  chain.  The  Euclidean  distance 
measure  could  be  used;  however,  a simpler  and  equally 
effective  measure  for  the  purpose  at  hand  is  provided 
by  the  residue-chain  length.  A residue  chain  between 
two  points  is  the  standardized  chain  of  minimum  length 

connecting  the  two  points10.  The  computation  of  resi- 
due-chain length  Involves  only  the  operations  of  addi- 
tion and  branching,  and  because  of  its  simplicity  is 
the  preferred  distance  measure  for  the  centroldal  pro- 
file. Its  use  does  require,  however,  that  the 
coordinates  of  the  centroid  be  quantized  to  the 
nearest  grid  node,  thereby  slightly  increasing  the 
quantization  noise  in  the  blob  description. 

For  a closed  chain,  the  choice  of  the  lnitium  is 
normally  arbitrary.  To  provide  some  uniformity  in  the 
selection  of  the  origin  of  the  centroldal  profile,  the 
lnitium  is  cyclically  shifted  to  the  chain  node  maxi- 
mally distant  from  the  centroid.  If  the  maximum  is 
not  unique,  any  of  the  may  be  chosen.  This 

shifting  of  the  lnitium  provides  same  orientation  in- 
variance for  the  centroldal  profile.  However,  if 
there  is  more  than  one  maximum,  there  is  an  ambiguity 
of  order  equal  to  the  number  of  maxlmally-dlstant 
chain  nodea.  This  is  only  a minor  drawback,  as  ve 
shall  see  shortly. 

The  dependence  on  scale  can  be  rmaoved  by  divid- 
ing all  profile  values  by  the  maximum  value.  The 
actual  maximum  value  will  be  called  the  vertical 
normalization  factor.  Every  centroldal  profile  thus 
will  have  a value  of  unity  at  the  origin  and  be  less 
than  or  equal  to  unity  elsewhere. 

One  additional  normalization  step  remains.  There 
is  no  fixed  rule  concerning  the  number  of  chain  linka 
that  are  needed  to  describe  a given  blob.  Ouidelinea 
exist  for  selecting  the  quantization  grid  fineness 
that  are  based  on  the  value  of  the  minimum  radius  of 
curvature  that  is  to  be  preserved  in  the  resulting 
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chain  ■ However,  the  selection  of  the  smallest  radius 


of  curvature  is  subjective,  and  a shape  comparison 
scheme  should  not  be  too  rigidly  dependent  on  such  a 
loose  criterion.  Accordingly  ve  must  also  normalize 
the  abscissa  of  the  centroldal  profile.  This  is  done 
by  setting  the  maximum  abscissa  value  - corresponding 
to  one  complete  trace  around  the  blob  contour  (or, 
equivalently,  one  pass  over  the  ccmplete  chain)  - 
arbitrarily  to  100.  All  other  values  are  then 
adjusted  in  proportion.  The  factor  required  to  yield 
the  uniform  abscissa  scale  vill  be  called  the  horizon- 
tal normalization  factor.  A centroldal  profile, 
together  with  the  blob  whose  shape  it  represents,  is 
shown  in  Fig.  5. 

For  the  blob  of  Fig.  5,  the  vector  from  the  cen- 
troid to  successive  chain  nodes  rotates  uniformly 
about  the  centroid  in  the  clockvlse  sense.  This  vill, 
of  course,  not  be  the  case  in  general.  To  avoid  pos- 
sible ambiguities  in  the  Interpretation  of  a centroid- 
al  profile,  ve  shall  take  profile  values  corresponding 
to  clockvlse  angular  rotation  as  being  positive,  and 
those  corresponding  to  counterclockwise  rotation,  as 
being  negative.  An  Illustration  of  this  is  provided 
by  Fig.  6. 

The  centroldal  profile  represents  the  blob  with- 
out regard  to  scale,  orientation,  and  (in  a limited 
sense)  quantization  fineness.  In  effect  only  the 
shape  remains.  Hote  that  the  profile  generation  pro- 
cess is  reversible;  if  the  origin  shift  and  the  verti- 
cal and  horizontal  normalization  factors  are  known,  it 
is  possible  to  reconstruct  the  blob  chain  from  its 
centroldal  profile. 

Two  blobs  may  be  compared  as  to  their  shape  by 
comparing  their  centroldal  profiles.  If  both  profiles 
have  a single,  veil-defined  maximum  (which,  of  course, 
will  be  at  the  origin),  the  comparison  is  straight- 
forward. Suitable  measures  of  shape  dissimilarity  are 
the  integral  of  the  differences  between  the  tvo  pro- 
file functions  and  the  integral  of  the  squared  differ- 
ences. 

If  one  or  both  of  the  profiles  have  a maximum 
that  is  not  well  defined,  or  have  multiple  maxima,  a 
better  means  for  comparison  is  provided  by  convention- 
al signal  cross-correlation.  This  vill  yield  a meas- 
ure of  the  similarity  as  well  as  identify  possible 
errors  in  the  origin  selection. 

It  should  be  pointed  out  that  the  abscissa  of  the 
centroldal  profile  represents  the  (normalized)  link- 
index  parameter  1 of  the  boundary  chain.  It  does  not 
represent  uniform  angular  rotation  about  the  centroid; 
nor,  in  fact,  is  it  a precise  measure  of  distance 
along  the  boundary  because  no  account  is  taken  of  the 
actual  lengths  of  the  links  (1  or  </?) . This  variation 
in  link  length,  together  with  the  initial  chain- 
generating quantization  and  the  node-quantization  of 
the  centroid  mentioned  earlier  are  the  chief  sources 
of  noise.  Unless  held  to  a low  value  by  using  a 
sufficiently  fine  grid  size,  the  effect  of  this  noise 
will  be  to  reduce  the  quality  of  shape  matching. 

Seme  good  "feel"  for  adequate  quantization  can  be 
gotten  by  selecting  a blob  that  is  repreaentative  of 
those  of  interest,  quantizing  it  for  tvo  different 
orientations,  computing  the  corresponding  centroldal 
profiles,  and  then  comparing  these.  If  the  quality 
of  s^tch  is  satisfactory,  the  quantization  fineness 
is  likely  to  be  adequate. 

The  precision  of  the  centroldal  profile  can  be 
improved  - though  at  considerable  increaae  in  process- 
ing cost  - by  plotting  the  distance  from  centroid  to 
chain  node  against  the  angle  this  vector  makes  with 
the  x axis,  and  using  the  Euclidean  distance  from  the 
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tru«  centroid  (rather  than  from  the  node  nearest  the 
centroid).  The  same  origin  shift  and  vertical  normal- 
ization are  then  applied.  The  abscissa  range  nov 
will,  of  course,  be  precisely  equal  to  360°  and  no 
horizontal  normalization  Is  necessary.  Although  rela- 
tively simple  algorithms  exist  for  calculating  the 
Euclidean  distances  betveen  points  and  the  angles 

relative  to  the  x axis11'1,  indications  thus  far  are 
that  these  refinements  are  not  varth  the  extra 
effort . 
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